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Carey (1978) has estimated that children learn an average of nine new words a day between 
the ages of eighteen months and six years as their vocabulary expands from 0 to 14,000 words. 
Nagy and H^nan (1987) estimate that this rate of lexical growth continues on through the high 
school years, with an average annual rate of growth of 3000 words, or 8 words per day. While 
estimating vocabulary size is an imprecise enterprise, even the most consm^ative estimates suggest 
that vocabulary acquisition is an incredibly rapid process. 

How do children acquire such impressive vocabularies? During the preschool years, 
children are not generally given direct instruction in word meaning and use, but acquire new words 
through incidental exposure in everyday situations. In fact, a single exposure to a new word in 
naturalistic conversations has been shown to be a sufficient means for even a very young child to 
acquire some understanding of this new term (Carey, 1978, 1982; Carey & Bartlett, 1978; 
Dickinson, 1984; DoUaghan, 1985). In one hearing of the word, the child is able to map an initial 
but incomplete semantic representation onto the spoken word, a pixx^ess Carey has called fast 
mapping. Subsequent exposure to the word will add to this representation, allowing for a more 
complete mapping of the meaning onto the word (see Rice, 1990 for a proposed model of this 
process). Apparently, this mapping process (or some version of it) from words in context - either 
spoken or written - continues through the school years as children acquire and hone the ability to 
read and write. Even in high school children receive direct instruction on only a few hundred of 
the 3(XX) new words they leam each year. 

This ability of young children to benefit from exposure to new words in context raises the 
question of whether or not different kinds of conversational contexts may provide different levels 
of exposure. In this paper, we address two questions: ( 1) do some types of conversation provide 
more exposure to new vocabulary than others?, and (2) what is the relationship between exposure 
to and use of new or rare words in different situations during the preschool years and later 
vocabulary related measures? 

In order to address these questions, we undertook a study of vocabulary found in a variety 
of conversations that took place within families of preschool-aged children, conversations which 
had been previously recorded as part of a larger study. In this paper, we examine the set of words 
used in these conversations and identify what we call rare words from these lists. We explore the 
frequencies of rare words in different conversational settings: at family mealtimes, during book 
readings, while mothers and children play with toys, and during a report of a recent past event told 
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by mothers and childreni. After establishing these frequencies, we then return to the individual 
transcripts to track the use of the rare words by mothers and children and relate those results to 
vocabulary test scores and the ability to give definitions of a set of common words when the 
children were five years oldL 

The Study 

The spoken words used for this study are drawn from transcripts of conversations collected 
from 82 families (with 84 target children - two families had twins, both of whom are in the study). 
These families are all participants in the Home-School Study of Language and Literacy 
Development, a longitudinal study of the social precursors to language developmoit in low- 
income, English-speaking families. This project, which has been funded by the Ford Foundation, 
the Spencer Foundation, and the Etepartment of Health and Human Services, has been collecting 
data annually since 1988 both in the children's homes and in their classrooms* The co-principal 
investigators of the project are Catherine E. Snow of the Harvard Graduate School of Education 
and David K. Dickinson of the Education Department, Clark University (Snow, 1991; Snow, 
Dickinson, & Tabors, 1989). 

The children in the study come from a range of cultural, racial, and economic backgrounds. 
Fifty-five of the children (65 .5 percent) are white, twenty-three (27.4 percent) are African- 
American, six (7.1 pa-cent) are of Hispanic heritage. There are 41 males and 43 females. 
Although the families in the study are considered to be low-income on the basis of the child's 
eligibility for Head Start or other similar preschool programs, they, in fact, represent a wide range 
of social and economic situations^ About half of the mothers (48.8 percent) have reported that they 
graduated frx)m high school and have not pursued further education. A quarter of the mothers 
(25.0 percent) have reported that they did not firush high school, while the remaining quarter (26.2 
percent) have reported tiiat they have received some post-high school education, usually some 
vocational training. Half of tiie families reported welfare as their primary source of income. 

Family configurations also vary widely among the families, A total of 28 families 
consisted of one parent (mother) and h^ child or children at tiie beginning of the study. Thirty- 
nine families reported two adults, usually the fatiier, step-father, or another adult male in the 
household. Eleven other families reported the presence of more than two adults living in the home. 
For example, two families consisted of tiie motiier, tiie target child, and tiie motiier's parents. 

Etata collection 

As part of the study, the families were visited at home when the children were three years 
old and again when they were four. During these visits each mother was asked do a variety of 
tasks witii h^ child. First, she was asked to read two books to her child, Tlie Very Hungry 
Caterpillar, by Eric Carle, and the child's favorite book (If the child did not have a favorite, the 
mother chose one, and if finding an appropriate book was a problem, tiie home visitor provided 
one.) Next, she was asked to elicit a report from the child of some interesting or exciting event 
which had happened recentiy. And finally, tiie mother and child were asked to play witn a set of 
toys that the home visitors provided. All of the conversations which occurred during these tasks 
were tape recorded. At the end of the visit, the mothers were given a blank tape and recorder and 
asked to record a family mealtime conversation. The recorders and tapes were later retrieved by tiie 
home visitors. 



1 For the purposes of this paper, we will be addressing conversational contexts 
exclusively, altiiough it is understood tiiat tiiere are otiier possible contexts as well, such as video 
(Rice, 1990) or written texts. 
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Developing the word lists 

All of the conversations v^'hich were recorded during the home visits were transcribed into 
computer files according to the Codes for the Human Analysis of Transcripts (CHAT) conv^tions 
for analysis by the Child Language Analysis (CLAN) software available through the Child 
Language Data Exchange System (CHILDES) (MacWhinney & Snow, 1990), Using a CLAN 
progi-am called FREQ (for frequency), word lists were generated for each of the conversational 
contexts. The results of this analysis are displayed in Table 1. 

As can be seen in Table 1, a total of 153 visits were made to the fiamiUes in the two years 
resulting in 150 elicited reports, 151 of each of the book readings, 153 toy plays, but only 107 
mealtimes. The varying numbers are a result of the feet that young children do not always 
cooperate with the best-laid plans of researchers, and that collecting the mealtimes from the families 
proved a difficult task for the home visitors. Each of the tasks resulted in a different quantity of 
talk, indicated by the number of tokens (total number of words) used. The elicited report, for 
instance, was a difficult task for the three- and four-year-olds as they had to first remember and 
then structure their talk about a past event with their mothers' help; this task resulted in the least 
amount of talk. The Very H ungrv Caterp illar is a relatively short book and therefore generated less 
talk than the variety of books which were chosen as a second book to read2. The toy play sessions 
generated the most words among the tasks which were done during the home visits. During toy 
play, mothea-s and children chatted while they manipulated the toys, often discussing fantasy world 
scenarios as they played. Unlike the book readings and eUcited reports, toy play lasted as long as 
fifteen minutes, and it was up to the home visitors to terminate the task. And, finally, the family 
mealtimes generated the highest number of tokens (despite the lower number of transcripts), being 
the most naturalistic and least time-ctMistrained of the tasks, averaging about 20 minutes in length, 
and involving all of the members of the family, not just the mother-child dyad. The total corpus 
for all of the speakers for all of the tasks combined from both visits amounted to 527,482 spoken 
words. 

Table 1 

Types and Tokens of Words Found in Transcripts By Conversational 
Context (Age-3 and Age-4 Visits Combined) 

Conversational 
Context 

Elicited Reports 
Book Readings (VHC) 
Book Readings (Choice) 
Toy Play 
Meal times 

Overall 

Table 1 also indicates tliat the tasks generated different numbers of types (the total number 
of different words). For instance, although the eUcited reports generated a Uttle more than half as 
many tokens as the reading of The Verv Hunerv Caterp illar, there were half again as many types in 
the elicited reports. This is most likely explained by the fact that both the text and the discussions 



Number of 






Transcripts 


Tokens 


Types 


150 


39,412 


2299 


151 


72,798 


1433 


151 


107,433 


5068 


153 


152,914 


3349 


107 


154,925 


4803 


712 


527,482 


8456 



- For the purposes of this analysis the text of the books was included as spoken words in 
the corpus. 
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concerning the text of The Very Hungry Caterpillar were highly redundant, whereas the elicited 
reports covered many topics ranging from a trip to an amusement park to what happened at school 
that day, generating, therefore, a more diverse set of words. In the same way, although the 
number of tokeais for toy play and mealtimes are similar, toy play, with its more constrained set of 
circumstances, i.e., a single set of toys, generated far fewer types than mealtimes, which involved 
wide-ranging conversations among femUy members. Finally, while the number for the overall 
tokens represmts all of the tokens added togetlia* from the different convmational contexts, the 
figure for overall types is smaller than the total of all types in the sq)arate categories, since many of 
the types are found in multiple settings. 

Having generated word lists for each of the conversational contexts, it was necessary to 
develop a procedure which would help us define a certain number of the words represented in each 
corpus as "rare vocabulary," i.e. words which would be unusual for a preschooler to have been 
exposed to previously. Our decision was to use a proc^ure which would extract the most 
comm(Mi words from the corpus, leaving a list of uncommon words, and then follow a furtha* set 
of procedures to remove uncommon words wWch were, in fact, merely oddities, leaving, at last, a 
residual list of rare words. 

We used as our list of common words the list developed by Jeanne Chall and Edgar Dale of 
words known by 4th graders (in press). In its original form this list contained 3,000 uninflected 
words. After adding inflections and removing proper nouns, we arrived at a list of 7,881 words. 
Each of the original word lists was again submitted to the FRJBQ pit)gram, this time removing any 
words found on the common words list The resulting lists contained two types of uncommon 
words: rare words (like "arboretum," "bureaucratic," and "carbohydrates") and oddities (like 
"cowabunga," "brang,** and "tummy"). Each of the uncommon word lists was then edited to 
remove the following forms: all proper nouns; exclamations, conversational markers, and 
expletives; forms of address; child reduced forms; slang, dialectical forms, and incorrect forms; 
and child culture terms. These final lists v/ecc considered to contain words which would be 
unusual or rare in the lexicon of a three- or four-year-old. 

Correlational analysis 

Once the lists w^e generated, we obtained each mother and child*s frequency for use of 
lare words in each of the conversational contexts. These frequencies v/ecc then used in a 
correlational analysis with two vocabulary measures which were administered to the children when 
they were five years old. The first of these was die Peabody Picture Vocabulary Test (PPVT-R) 
(Dunn & Dunn, 1981), a common measure of a child's receptive vocabulary. The child's raw 
score on the PPVT was the outcome measure on this task. 

The second measure was one designed by Snow (1990) to assess a child's ability to give a 
formal definition. In this task the child was asked to give definitions of 14 nouns. These 
definitions were scored on a variety of dimensions including whether or not they were formal 
definitions, which require a superordinate category witii a relative clause (e.g., "a thief is a person 
who steals"). The definitions score is an indicator of the child's knowledge of a word meaning 
and his ability to express that meaning to someone else. 

Results 

We report the results in three sections: 1) exposure to rare words across contexts; 2) use 
of rare words by mothers and children; and 3) associations between rare word use and other 
vocabulary measures. 

Exposure to rare, words across situations 

In answer to the fiurst of our two questions, we found that the different conversational 
contexts did, indeed, display different levels of exposure to rare words. Table 2 contains the 
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frequencies of types of words found in each list (the original or all words list, the uncommon 
words list and the rare words list) and the percent of rare words (the number of rare words divided 
by the total words in the original list) for each conversational context 

Table 2 

Total Types of All, Uncommon, and Rare Words and Percent of Rare Words By 
Conversational Context 



Conversational 


All 


Uncommon 


Rare 


Percent of 


Context 


Words 


Words 


Words 


Rare Words 


Book Readings (VHC) 


1433 


271 


143 


10.0% 


Elicited Reports 


2299 


710 


269 


11.7% 


Toy Play 


3349 


1113 


555 


16.6% 


Mealtimes 


4803 


1976 


1086 


22.6% 


Book Readings (Choice) 


5068 


1819 


1309 


25.8% 


Overall 


8456 


4232 


2601 


30.8% 



The variations in the proportion of ra^e words as shown in Table 2 indicates that the 
diffo'ent conversational contexts provide varying levels of new or rare vocabulary input. At the 
high end, the reading of the book of choice provides the largest proportion of rare words, while 
reading The Very Hungry Caterpillar represents the lov/est. As mentioned previously, both the 
text and the discussions of the text of The Very Hungry Caterp illar were highly i^undant; this fact 
influences not only the number of types, but also the relatively low proportion of rare words. The 
book of choice, on the other hand, represents a wide range of material, during which many 
vocabulary items are introduced which fall into our rare category. (Of course, each child was 
exposed to only one book, not all of the different ones.) 

The everyday conversation found in mealtimes also provided a large amount of vocabulary 
input. This is where the most talk took place among all the conversations we collected. The sheer 
yolunie of talk resulted in the possibility of exposure to many new words. But it appears that it 
isn't simply that more talk means more rare words. Explanations and narratives within mealtime 
conversations, which were coded and processed separately, accounted for 673 of the 1086 rare 
words (62 percent) found in mealtimes. It would appear that these specialized types of discourse 
provide a particularly im.portant source of vocabulary input for the children. 

As Table 2 demonstrates, toy play generated a lower proportion of rare words than did 
mealtimes and reading the book of choice. In toy play situations, the conversation necessarily 
converged around the particular set of toys which had been provided by the home visitors. 
However, the new words were not limited to the names of items m the toy bag; the fantasy play 
that mothers and children created around these toys allowed them to broaden the vocabulary they 
used. 

Elicited reports presented a relatively limited source of new words for children. In the 
elicited reports, the task itself was so challenging for the young children, that mothers might well 
have chosen to reduce the vocabulary load during this task. 

Finally, as in Table 1, the overall number of types for the all lists is smaller than the sum of 
the individual all lists, due to the overlap in words across lists. The fact that the proportion of rare 
words in the overall corpus is 30.8 percent indicates that common words are overlapping to a high 
degree across the conversational contexts, but that the rare words are overlapping to a lesser extent. 
In fact, 77 percent of the i^are words appear in only one of the contexts. 
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Use of rare words by mothers and children 

In order to begin to answer the second of our questions concerning the relationship 
between exposure to rare words and later vocabulary measures, we examined the use of rare words 
by individual target children and their mothers during the three-year-old and four-year-old visits. 
We consider the use of rare words by the moiher as an measure of the child's exposure to new 
words. A child's use of rare words could have been a repetition or discussion of a word used by 
the mother (or book), or it could have been an established word in the child's vocabulary. In the 
former case, we see an instance of the child's uptake of a new word; in the latter, we see the 
child's present vocabulary knowledge. Our analysis does not allow us to distinguish these cases. 
The results are displayed in Table 3. 

Table 3 

Means and Ranges of Frequencies of Types of Rare Words Used by Children and Mothers 
at Age'3 and Age-4 Visits by Conversational Context 



ChUd Mother 



Conversational 
Context 


Age 3 
Visit 


Age 4 
Visit 


Age 3 
Visit 


Age 4 
Visit 


Elicited Reports 


0.87 


0.89 


1.70 


1.63 




(0-8) 


(0-6) 


(0-7) 


(0-15) 


Toy Play 


1.36 


2.33 


4.89 


5.44 




(0-9) 


(0-8) 


(0-21) 


(0-23) 


Book Readings 


1.00 


1.19 


4.66 


4.17 


(VHC) 


(0-5) 


(0-6) 


(0-13) 


(0-10) 


Book Readings 


0.68 


1.01 


11.93 


13.83 


(Choice) 


(0-4) 


(0-10) 


(0-67) 


(0-68) 


Mealtimes 


3.06 


4.28 


7.69 


9.78 


(All) 


(0-18) 


(0-17) 


(0-44) 


(0-35) 


Explanatory 


0.64 


1.07 


2.63 


3.27 


Talk 


(0-8) 


(0-6) 


(0-21) 


(0-18) 


Narrative 


0.70 


0.63 


1.84 


1.84 


Talk 


(0-13) 


(0-6) 


(0-15) 


(0-16) 



First, not surprisingly, the means are far smaller and the ranges far less extensive for the 
children than for the mothers. Unlike many results related to conversation in which mothers and 
children mirror each others' usage, these results point to the wide differential in vocabulary use 
between adults and young children. Young children simply don't know as many words. Second, 
there are no obvious developmental differences in the two visits; in fact, there is considerable 
consistency across years within conversational contexts. Third, as has been identified before, the 
use of rare words is context specific; for instance, the lowest for both tlie mothers and children 
comes in the elicited reports and in the narratives at mealtimes (strikingly similar circumstances), 
while the highest, for the mothers, comes in the book of choice in which the text is clearly 
providing much of the new vocabulary input Finally, the means and ranges for both children and 
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mothers demonstrate that the use of rai^ words is not normally distributed in this population, but is 
severely skewed to the left, meaning that only a few of the mothers and children account for the 
upper range of rare word use. 

Associations between rare word use and later vocabulary measures 

The corrdational analysis between mother's and child's individual rare word use and the 
vocabulary measures administ^ed individually when the children were five years old shows that 
thCTe are relationships between exposure to and use of rare words in different conversational 
contexts and later achievement on vocabulary measures. Table 4 presents the results for mothers* 
and children*s rare wcwrd use and the outcomes on the PPVT; Table 5 presents the results for the 
mothers' and children's rare word use and the outcomes on the definitions task. 

Table 4 

Correlations between Children's and Mothers' Rare Word Use at Age-3 and Age-4 Visits an i 
Children's Scores on the PPVT at Age 5 (Pearson r, jj <,01) 

Child Mother 
Conversational Age 3 Age 4 Age 3 Age 4 

Context Visit Visit Visit Visit 



Elicited Reports ,409 .496 

Toy Play .386 .507 .363 

Mealtimes .428 
Explanatory 

Talk .398 

Narrative 
Talk 
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Tables 

Correlations between Children's and Mothers* Rare Word Use at Age-3 and Age'4 Visits and 
Children's Scores on the Etefmitions Task at Age 5 (Pearson f, p <.01) 



Giild Mother 
Conversational Age 3 Age 4 Age 3 Age 4 

Context Visit Visit Visit Visit 



Elicited Reports .449 .353 

Toy Play .323 .343 

Mealtimes 
Explanatory 

Talk .386 

Narrative 
Talk 



From these tables, we see that the use of rare words by mothers and children in toy play 
and elicited rqx)rts is positively associated with both vocabulary measures at age 5. Specifically, 
children's and mothers' use of rare words at age 3 are correlated with the PPVT, while their 
frequencies of rare words at age 4 are correlated with the definitions score for both contexts. 
Children's rare word use at age 4 in toy play is correlated with the PPVT scores as well. 

The child's use of rare words within mealtime explanatory talk at age 3 was also a predictor 
of both vocabulary measures, while children's use of rare words in general mealtime talk at age 4 
correlates with the PPVT, and mothers* use of rare words in mealtime narrative talk at age four 
correlates with the definitions outcomes. 

The most outstanding non-finding in these tables is that there were no correlations between 
the use of rare words in either book reading situation and the vocabulary measures. An 
explanation of why this may be the case follows in the discussion section. 

Discussion 

What have we found out about the effects of exposure of young children to new vocabulary 
in different conversational contexts? First, we have found that not only do different conversational 
contexts generate different proportions of rare words, they also gen^te different rare words as 
well, i.e., 77 percent of the rare words were found in only one context. 

We also have found that different conversational contexts may be more supportive of new 
vocabulary acquisition than others. For example, in both the elicited reports and the mealtime 
narratives, far fewer rare words were used by both mothers and children than in the other contexts, 
perhaps due to the other demands of the task. The use of rare words in these two narrative 
contexts, however, was still predictive of later vocabulary measures. In toy play, both mothers 
and children displayed higher means and ft-equencies, indicating that this may be a highly 
supportive context for the use and uptake of new vocabulary; tWs use was also related to later 
vocabulary measures. In overall m^time conversation, the means and frequencies for mothers' 
and children's use was even higher, although many of these words might well be related to the task 
at hand, i.e., the consumption of broccoli, spaghetti, and cantaloupe. The more potent source of 
new word learning withm mealtimes seems to be during explanatory talk, particularly at age 3, 
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when children, in some explanations, were even asking for definitions of words ("what does *your 
highness* mean?"). 

It is often assumed that reading books to children will help them learn new words. We 
have found that they are indeed exposed to new words (depending on the book), but it is not clear 
that young children learn as much firom book reading as they do from other situations. None of the 
measures of rare vocabulary use in book readings showed any relationship to the vocabulary 
measures. There are a couple of possible explanations for this finding. First, in reading books, 
the vocabulary is dictated more by the book and less by the speakers* abilities than in the other 
conversational contexts. Perh2^)s the rare words used need to be onc^ that the mother or child 
already know in order for them to be used in helpful ways in the conversation during book 
reading. Second, it may be that the use of new words in bookreading hai too little context 
available for a child of three or four to begin the mapping of meaning onto the word, and more 
discussion around the word is needed. In either case, this finding demands a closer analysis of 
bookreading as a word-learning situation. By examining word lists generated without the text of 
the book (which we included in this study), we would be able to tease out some of these issues. If 
discussion of a new word occurs, rather than just hearing the word read in its book context, will 
the child be more likely to expand her vocabulary? And does more lengthy, in-depth discussion 
around a word help more than incidental exposure to the word in discussion about the book? 

Different conversational contexts afford different opportunities to learn new words. 
Because each context varies in the kinds of words and the ways words are used, it appears that 
exposure to vocabulary-rich talk in many settings will allow the greatest opportunity for growth. 
Each setting will teach not only its own vocabulary, but its ways of using those words, according 
to the speakers' purposes for talk within that setting. 
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